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The coronavirus nucleocapsid (N) protein is a multifunctional viral gene product that encapsidates the RNA 
genome and also plays some as yet not fully defined role in viral RNA replication and/or transcription. A 
number of conserved negatively charged amino acids are located within domain III in the carboxy end of all 
coronavirus N proteins. Previous studies suggested that the negatively charged residues are involved in virus 
assembly by mediating interaction between the membrane (M) protein carboxy tail and nucleocapsids. To 
determine the importance of these negatively charged residues, a series of alanine and other charged-residue 
substitutions were introduced in place of those in the N gene within a mouse hepatitis coronavirus A59 
infectious clone. Aspartic acid residues 440 and 441 were identified as functionally important. Viruses could 
not be isolated when both residues were replaced by positively charged amino acids. When either amino acid 
was replaced by a positively charged residue or both were changed to alanine, viruses were recovered that 
contained second-site changes within N, but not in the M or envelope protein. The compensatory role of the new 
changes was confirmed by the construction of new viruses. A few viruses were recovered that retained the 
D4,4,-to-arginine change and no compensatory changes. These viruses exhibited a small-plaque phenotype and 
produced significantly less virus. Overall, results from our analysis of a large panel of plaque-purified 
recovered viruses indicate that the negatively charged residues at positions 440 and 441 are key residues that 


appear to be involved in virus assembly. 


Coronaviruses are enveloped RNA viruses that cause respi- 
ratory and enteric infections in humans and many domesti- 
cated animals. Members of the family Coronaviridae contain 
single-stranded, positive-sense genomes that range from ap- 
proximately 27 to 31 kb in length. The viral genes are expressed 
through a discontinuous transcription mechanism that yields a 
nested set of subgenomic RNAs (39). Coronavirus virions con- 
tain at least three envelope proteins, membrane (M), spike (S), 
and envelope (E). The genomic RNA is encapsidated by the 
nucleocapsid (N) protein as a helical nucleocapsid (10, 26). 
The S protein is the viral receptor attachment protein that 
facilitates infection through fusion of viral and cellular mem- 
branes and is the major target of neutralizing antibodies during 
infection (15). The M protein is a major envelope component 
that plays an important role in virus assembly (11, 21, 31, 33, 
46). The E protein is a minor component of the viral envelope 
that also plays a critical role in virus budding. Coexpression of 
the E and M proteins is sufficient for formation of virus-like 
particles (3, 8, 46). Deletion of the E gene from the mouse 
hepatitis coronavirus (MHV-CoV) genome results in severely 
crippled virus (23), whereas elimination of expression of the 
gene in porcine transmissible gastroenteritis coronavirus 
blocks virus assembly (9, 34). Recently, the severe acute respi- 
ratory syndrome coronavirus (SARS-CoV) and MHV-CoV E 
proteins were shown to exhibit viroporin activity (25, 27, 48). 
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Virions assemble at intracellular membranes of the endoplas- 
mic reticulum Golgi intermediate compartment (44). 

The focus of this report is the multifunctional N protein. 
The protein is a major structural component of virions that 
plays a role in virus assembly though interactions with the 
viral RNA, the M protein, and N-N interactions (13, 22, 28, 
30). Results from a number of studies suggest that N also 
plays a role in viral RNA synthesis (1, 2, 5, 7, 12, 45). 
Recovery of infectious cloned coronaviruses is increased 
when N protein transcripts are included during RNA trans- 
fection (1, 4, 49). Recent data strongly support earlier stud- 
ies by providing direct evidence that N is involved in viral 
RNA replication and/or transcription (40). N may also be 
involved in the translation of viral mRNAs (42). 

Coronavirus N proteins are phosphorylated. The proteins 
are highly basic, with isoelectric points (pI) of 10.3 to 10.7 (24). 
A three-domain structure for the protein has been proposed 
based on early sequence comparisons of MHV strains (35). 
The amino terminal and central domains of all coronavirus N 
proteins exhibit an overall positive charge, whereas the car- 
boxy-terminal domain is acidic. Conservation of negatively 
charged amino acids within the carboxy ends of all coronavirus 
N proteins suggests that the residues are functionally relevant. 
Furthermore, data from an earlier study suggest that the car- 
boxyl end of the protein mediates interaction with the M pro- 
tein during assembly, and the charged resides within the region 
were hypothesized to possibly facilitate the interaction (22). 

Within the carboxy-terminal 22 amino acids of the MHV- 
CoV N protein there are eight negatively charged residues 
(Fig. 1). We therefore considered whether the charges are 
important for virus viability and output. The rationale for fo- 
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FIG. 1. Schematic illustration of N negatively charged mutant constructs. An alignment of the negatively charged region in the carboxy ends 
of the N proteins of representative group II coronaviruses (MHV-CoV A59 [MHV], human coronavirus OC43, and bovine coronavirus [BCV]) 
and the newest member of the family, SARS-CoV (SARS), was generated using ClustalW (43). Terminal amino acid numbers are shown at the 
end of each sequence. Positively and negatively charged amino acid residues are indicated above the MHV and SARS sequences by plus and minus 
signs, respectively. A schematic of the three-domain model of the coronavirus N protein separated by the A and B spacer domains (35) is shown, 
with the carboxy-terminal amino acids expanded where mutations were introduced. The amino acid substitutions are indicated below the sequence 
of the extreme end of domain III. Recovery of mutant viruses is indicated by plus and minus signs at the left of each mutant virus designation. WT, 


wild type. 


cusing on the residues was based on previous studies from our 
laboratory and others (13, 22), the conservation and concentra- 
tion of the negative charges, and the fact that charged residues 
can play an important role in helping to mediate protein-pro- 
tein interactions. Alanine and other charged-residue codon 
substitutions were introduced in place of the conserved nega- 
tive charges in the N gene within an MHV-CoV AS9 infectious 
clone. Results from the panel of mutant viruses identified 
aspartic acid residues 440 and 441 as functionally important. 
Maintenance of the net negative charge across the COOH end 
appears to drive the selection of new viruses when charge 
reversal substitutions are placed at these positions. These res- 
idues likely play roles in helping to mediate electrostatic inter- 
actions between N and other proteins, most likely the M pro- 


tein, during virus assembly, but they may also contribute to 
other protein interactions that are important for the other 
functions of N. 


MATERIALS AND METHODS 


Cells and viruses. Stocks of wild-type MHV-CoV A59 and infectious cloned 
viruses were grown in mouse 17 clone 1 (17C11) cells. Titers were determined in 
mouse L2 cells. L2 and 17Cl1 cells were maintained in Dulbecco’s modified 
Eagle’s medium (DMEM) supplemented with 5 to 10% heat-inactivated fetal 
calf serum, as previously described (41). Baby hamster kidney (BHK) cells 
expressing the MHV Bgpla receptor were kindly provided by Ralph Baric, 
University of North Carolina at Chapel Hill (50). The cells were maintained on 
Glasgow modified Eagle’s medium supplemented with 5% heat-inactivated fetal 
calf serum, 10% tryptose phosphate broth, glutamine, penicillin-streptomycin, 
and 80 wg/ml Geneticin (G418). 
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TABLE 1. Oligonucleotides used in this study 


J. VIROL. 


Name Primer sequence® Use 

DyyoR CAC TAC GCC ATC ACG AAG GAT CTG AGC CAA CAG Mutagenesis 
Dy4,R CAC TAC GCC ACG ATC AAG GAT CTG AGC CAA CAG Mutagenesis 
D449-441 RR CAC TAC GCC ACG ACG AAG GAT CTG AGC CAA CAG Mutagenesis 
D44o-441 CAC TAC GCC AGC AGC AAG GAT CTG AGC CAA CAG Mutagenesis 
D44o-44, EE CAC TAC GCC CTC CTC AAG GAT CTG AGC CAA CAG Mutagenesis 
DygoA CGT AGT GCC AGC TGG GTT AGA AG Mutagenesis 
Dy5,;A CAC ATT AGA GGC ATC TTC TAA CCC ATC TGG CAC Mutagenesis 

451 CAC ATT AGA CTC ATC TTC TAA CCC ATC TGG CAC Mutagenesis 
EDD 449.45: AAA CAC ATT AGA GGC AGC TGC TAA CCC ATC TGG CAC Mutagenesis 


RyysG Forward 
RyysG Reverse 
TyogND44oR Forward 
Ty2gND44oR Reverse 
Ag36DD44,R Forward 
Ag36DD44,R Reverse 
MHV M Reverse 
MHV 4 Reverse 
MHV 6 Reverse 
MHV 7 Forward 


GCT CTG TGC AGC GAA ATG TAA GTG GCG AAT TAA CCC C 

CAG ACT TCT ATC CTC TGG GGT TAA TTC GCC ACT TAC ATT TCG C 
GCA GCG AAA TGT AAG TAG AGA ATT AAA CCC AGA GGA TAG AAG 
GAG CCA ACA GAC TTC TAT CCT CTG GGT TTA ATT CTC TAC TTA C 
AAC CCC AGA GGA TAG AAG TCT GTT GGA CCA GAT CCT TGA TCG TG 
CAC TAC GCC ACG ATC AAG GAT CTG GTC CAA CAG AC 

CGG TAC CTT TCA TAT CTA TAC 

AGT CTG CTT TGG CTG ATT CCT TC 

TTC CTG AGC CTG TCT ACG 

ATT CTG GTG GTG CTG ATG AAC CGG C 


Mutagenesis top stand 
Mutagenesis top strand 


Mutagenesis top strand 


Sequencing of E gene 
Sequencing of M gene 
Sequencing of M gene 
Sequencing of N gene 


Mutagenesis bottom strand 
Mutagenesis bottom strand 


Mutagenesis bottom strand 


MHV 8 Forward GGC AGA AGC TCC TCT GTA AAC C 


* Mutagenized codons are boldface and underlined. 


Construction of amino acid substitution mutants. Plasmid pGEM-5Zf(—)M-N, a 
pGEM-5Zf(—) vector (Promega) containing the entire M and N genes (EcoRV- 
SacI fragment), was used as the template for mutagenesis of the N gene. The 
oligonucleotides used to perform site-directed mutagenesis are listed in Table 1. 
The original mutants were constructed using the GeneEditor site-directed mu- 
tagenesis system (Promega), basically according to the manufacturer’s instruc- 
tions. The four reconstructed N double mutants were constructed by whole- 
plasmid PCR using Pfu polymerase (Stratagene) in the pGEM-Szf(—)M-N 
background, which contained the previously modified D449R or D44,R mutation. 
Following an initial incubation at 95°C for 3 min, the following conditions were 
applied for 18 cycles: 95°C for 15 s, 74°C for 60 s, and 68°C for 12 min. The PCR 
products were incubated at 37°C for 2 h with DpnI to destroy methylated 
template DNA before transformation into Escherichia coli DH5a. The entire 
EcoRV-Sacl region of the N gene was confirmed by sequencing. Following 
confirmation of the mutations in pGEM-5Zf(—)M-N, the N gene was subcloned 
into the MHV G clone using NhelI and SacI restriction sites. 

Generation of mutant viruses. Mutant viruses were generated using the MHV- 
CoV AS9 infectious clone kindly provided by Ralph Baric, University of North 
Carolina at Chapel Hill (50). Full-length cDNA clones were assembled as pre- 
viously described with a few modifications. Plasmids containing the cDNA cas- 
settes spanning the MHV genome were purified using QIAfilter Maxi cartridges 
(QIAGEN) and digested with Mlul and Esp3I for fragment A; BglI and Esp3I 
for fragments B and C; Esp3I for fragments D, E, and F; and Sfil and Esp3I for 
fragment G. The fragments were gel purified and ligated overnight in a reaction 
volume of 100 to 200 wl. The ligated DNA products were extracted with phenol- 
chloroform and ethanol precipitated. RNA transcripts were made using the 
mMessage mMachine T7 Transcription reagents and protocol (Ambion). Tran- 
scription reactions were carried out at 37°C for 2 h in the presence of an 
additional GTP. The MHV nucleocapsid gene was transcribed from pMHV-A59 
N (6) using T7 RNA polymerase. N gene transcripts were polyadenylated using 
Ambion’s poly(A) tailing system. 

Full-length MHV infectious-clone and N transcripts were electroporated into 
BHK cells stably expressing the MHV receptor (BHK-MHVR cells). BHK- 
MHVR cells had been seeded the previous day to reach approximately 70% 
confluence for electroporation. Cells were trypsinized, washed with phosphate- 
buffered saline, and resuspended at a concentration of 10’ cells/ml in OptiMEM 
(Invitrogen). RNA transcripts were electroporated into 800 pl of the cell sus- 
pension in a 4-mm-gap cuvette with three electrical pulses of 850 V at 25 wF 
using a Bio-Rad Gene Pulser II electroporator. Transfected cells were seeded in 
75-cm? flask and incubated at 37°C. The cells were monitored for syncytia. The 
medium and cells were harvested 24 to 48 h after electroporation. 

An aliquot of the frozen stock from the electroporated cells was used to infect 


MHV EM (+) CAG AAC TGT CCA ACA GGC CGT TAG CAA G 
MHV EM (-) GCA ACC CAG AAG ACA CCT TCA ATG C 
MHV MN (+) CCA CCT CTA CAT GCA AGG TGT TAA GC 
MHV MN (—) GGT CTG CCA CAA CCT TCT CTA TCT 


Sequencing of N gene 
RT-PCR (+) strands 
RT-PCR (—) strands 
RT-PCR (+) strands 
RT-PCR (—) strands 


L2 cells. The media were harvested from the infected cells at approximately 24 h 
p.i. Total RNA was extracted from cells that remained adhered to the flasks using 
Ambion’s RNAqueous-4PCR extraction system. The extracted RNA was treated 
with DNase I for 15 min at 37°C. RNA was reverse transcribed using Invitrogen’s 
Superscript reverse transcription (RT)-PCR and recommended protocol. The 
initial reaction mixture was incubated at 42°C for 2 min prior to reverse tran- 
scription for 50 min. The reactions were terminated by incubation at 70°C for 15 
min. Template RNA was destroyed by additional incubation for 20 min at 37°C 
after the addition of RNase H. The RT product was subjected to 30 cycles of 
PCR amplification using Ambion’s SuperTaq Plus. Following an initial incuba- 
tion of 95°C for 5 min, the following conditions were applied: 95°C for 30 s, 59°C 
for 30 s, and 72°C for 90 s, followed by a final 10-min extension at 72°C. PCR 
products were cleaned up using QIAGEN’s MiniElute columns and sequenced 
directly. 

Ten to 20 plaques were subsequently isolated from the electroporated cell/ 
medium virus stock. The plaques were passaged onto L2 cells. RNA was ex- 
tracted from the infected cells at approximately 24 h p.i. RT-PCR was per- 
formed, and the entire E, M, and N genes were sequenced. Selected plaques 
were amplified on L2 cells through five passages, at which time the sequences of 
the E, M, and N genes were again confirmed. 

Growth kinetics. Growth kinetics experiments were carried out in 17Cl1 cells 
infected with PS virus stocks at a multiplicity of infection (MOI) of 5 or 1. Cell 
culture supernatants were collected at various times after infection. Titers were 
determined by plaque assay on L2 cells. At approximately 48 h p.i. the agarose/ 
medium overlays were removed and the cells were fixed and stained with crystal 
violet. 

Northern blotting. Monolayers of 70 to 80% confluent 17CI1 cells in 60- 
mm-diameter dishes were individually infected with wild-type MHV and 
selected mutant viruses at an MOI of 0.1. RNA was extracted at 8 and 12h 
p.i. with TRIzol (Invitrogen) according to the manufacturer’s instructions. 
The RNA pellets were resuspended in RNase-free water. Equivalent amounts 
of total intracellular RNA were denatured and separated on 1% agarose gels 
containing formaldehyde at 85 V for 5 h essentially as described previously 
(41). After electrophoresis, the gels were vacuum blotted onto positively 
charged nylon membranes in 20 SSC (0.3 M NaCl, 0.3 M sodium citrate). 
Northern blotting analyses were performed using a digoxigenin-labeled 357- 
nucleotide MHV-CoV A59 N gene probe. The probe was transcribed using 
reagents for digoxigenin labeling (Roche Applied Science) according to the 
manufacturer’s directions. Images were quantified by densitometric scanning 
of the fluorograms and analyzed using ImageQuant software (Molecular 
Dynamics). 
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FIG. 2. Growth kinetics and plaque morphologies of Dy4gA, Dy5;A, EDD 44o.45, AAA, and D,;,E mutant viruses. (A) Single-step growth kinetics 
of mutant viruses relative to wild-type (WT) virus. Mouse 17ClI1 cells were infected with the wild-type virus and mutant viruses Dy4gA, D45,A, and 
EDD 449.45, AAA at an MOI of 5 or with wild-type and D,;,E viruses at an MOI of 1. Titers were measured by plaque assay in L2 cells at the 
indicated times. The error bars indicate the average deviations from the mean for two independent growth kinetic experiments. (B) Plaque sizes 
and morphologies of recovered mutant viruses in infected mouse L2 cells are shown. 


Metabolic labeling and analysis of proteins in infected cells. Monolayers of 70 
to 80% confluent 17Cl1 cells in 35-mm-diameter plates were infected with wild- 
type or selected mutant viruses at an MOI of 0.1 PFU/cell. At 4.5 h p.i., the cells 
were starved for 30 min with DMEM without methionine and cysteine prior to 
being labeled with 125 .sCi/ml of EXPRE*°S*°S protein labeling mix (Perkin- 
Elmer) for 30 min. Immediately after being labeled, the cytoplasmic fraction was 
harvested from one set of plates. A parallel set of plates were washed and refed 
with DMEM containing 10 times the normal amount of methionine and cysteine 
and chased until 9.5 h p.i., at which time both cytoplasmic lysates and the 
extracellular medium fractions were harvested. The cells were washed twice with 
cold phosphate-buffered saline and lysed with RIPA lysis buffer (1% Triton 
X-100, 1% deoxycholate, 0.3% sodium dodecyl sulfate [SDS], 150 mM NaCl, 50 
mM Tris-HCl, pH 7.6, 20 mM EDTA) containing 1 Complete, Mini, EDTA- 
free Protease Inhibitor Cocktail Tablets (Roche Applied Science). Lysates and 
media were clarified at 16,000 X g for 10 min. Virions in the extracellular media 
were lysed by the addition of an equal volume of 2X RIPA lysis buffer and 
sonication for 1.5 min at 30-s intervals with 30-s rests. Both intracellular and 
extracellular fractions were precleared by incubation with protein A-Sepharose 
at 4°C for 1 h with rocking. Viral proteins were immunoprecipitated by incuba- 
tion with rabbit anti-MHV antibody F88 (K. Holmes, University of Colorado 
Health Sciences Center) overnight at 4°C. Immunoprecipitated protein complexes 


were isolated by incubation with protein A-Sepharose for 2 h at 4°C with con- 
stant rocking. The immunoprecipitates were washed five times with RIPA buffer 
prior to elution in SDS-polyacrylamide gel electrophoresis (PAGE) sample 
buffer by heating them at 95°C for 5 min and were analyzed by SDS-PAGE on 
5 to 20% gradient gels. Prior to being dried, the gels were incubated for 30 min 
at room temperature with Amplify Fluorographic Reagent (GE Healthcare Life 
Sciences). Proteins were detected by fluorography. Protein products were quan- 
tified by densitometric scanning of the fluorograms and analyzed using Image- 
Quant software (Molecular Dynamics). 

Analysis of N protein isoelectric points. Prediction of the isoelectric points 
of the wild-type and mutant N proteins was performed using the ScanSite pI/M,, 
program algorithm, with the option to include phosphorylation sites, from 
ExPASy’s proteomics server at the Swiss Institute of Bioinformatics (http://ca 
.expasy.org/) (16). 


RESULTS 


Generation of N charged-residue mutant viruses. To begin 
examining the importance of the conserved negatively charged 
residues within the carboxy-terminal end of the MHV-CoV N 
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FIG. 3. Growth kinetics and plaque morphologies of DD 449.44;AA, DD 449.44;EE, and D44,R no. 8 viruses. (A) Mouse 17Cl1 cells were infected 
with wild-type (WT), DD 449.44, AA, and DD44o.44, EE viruses at an MOI of 5. Single-step growth kinetics analysis was performed by plaque assay 
in mouse L2 cells. The error bars indicate average deviations of two independent growth kinetics experiments. (B) Plaque sizes and morphologies 
of DD 449-44, AA, DD 44o-44,EE, and D44,R no. 8 viruses analyzed in parallel with wild-type virus in mouse L2 cells are shown. 


protein, substitutions were introduced within the last 15 amino 
acids (Fig. 1). Negatively charged aspartic acid (D) residues at 
positions 440 and 441 were changed either singly to positively 
charged arginine (R) or doubly to R, negatively charged glu- 
tamic acid (E), or neutrally charged alanine (A). Single alanine 
substitutions were introduced in place of D residues at posi- 
tions 446 and 451. D,;; was also replaced by a glutamic acid 
residue. Additionally, a triple mutation was introduced at po- 
sitions 449 to 451 in which the EDD residues were all changed 
to alanines. In all cases, 2 nucleotides were introduced to alter 
the individual amino acid codons, thus significantly decreasing 
the chances of reversions to allow us to study compensatory 
second-site changes. 

All mutations were generated by site-directed mutagenesis and 
studied in the context of a full-length MHV-CoV AS59 infectious 
clone (50). All full-length mutant RNAs yielded cytopathic effects 
characterized by centers of fusion following electroporation into 
BHK-MHVR cells. However, viable viruses were subsequently 
recovered for only eight of the nine mutants after passage onto 
new cells (Fig. 1). Mutant DD 445.44; RR exhibited a few centers of 
fusion after electroporation, but multiple attempts to passage the 
virus were unsuccessful. The eight viable mutant viruses were 
plaque purified and analyzed for their phenotypic and genetic 
characteristics through multiple passages. RT-PCR and sequence 
analysis of the N, M, and E genes before and after multiple 
passages were carried out to determine the genetic stability of the 
viruses. 

Negatively charged residues 446 and 449 to 451 are not 
absolutely required. Analysis of the recovered viruses with 


alanine replacements for D,,, and D,;, and a triple substitu- 
tion for EDD 449.451, aS well as a glutamic acid substitution for 
residue 451, indicated that the viruses were phenotypically like 
the wild-type parental virus (Fig. 2). The plaque size and mor- 
phology of the viruses were indistinguishable from those of the 
wild type. Recovered viruses were plaque purified, and mullti- 
ple plaques of each mutant were followed for five passages. 
Sequence analysis of P5 confirmed the stability of the intro- 
duced mutations and that no additional changes were present 
in the remainder of the N gene or within the E or M gene. 
Growth kinetics analysis demonstrated that all of the recov- 
ered viruses grew like the parental virus (Fig. 2). These results 
strongly suggest that the negative charges at the carboxyl end 
of the protein are not absolutely required for the protein to 
carry out its function(s). 

The conserved negative charges at positions 440 and 441 are 
very important. The removal of the negative charges at positions 
Dy4o and D44, had a strikingly different effect on the virus, com- 
pared with the mutants described above. While centers of fusion 
were observed for the five mutants that encompassed these resi- 
dues, only four viruses were subsequently recovered (Fig. 1). 
Multiple attempts to recover the DD44o.44;RR virus were not 
successful, indicating that the replacement of the negative charges 
with positive charges is detrimental to the virus. This strongly 
suggested that these are functionally important residues. 

The apparent importance of the DD 449.44, residues was fur- 
ther strengthened as the recovered viruses were analyzed. 
First, replacement of both residues with negatively charged 
glutamic acid (E) yielded virus that displayed a plaque size, a 
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FIG. 4. Summary of sequence analysis of recovered viruses from Dy4gR, Dy4,R, and DD 449.44; AA mutants. (A) Sequence analysis of the E, M, 
and N genes of multiple plaques from viruses with substitutions at amino acid positions 440 and/or 441 revealed additional changes only within 
the N gene. The bars represent the number of plaques analyzed that exhibited the additional changes indicated below each bar. Changes are listed 
as the wild-type residue at each position in the N protein and the corresponding change that was identified. (B) The five additional amino acid 
changes identified most frequently in the DygoR, Dyg,R, and DD44o.44;AA mutant viruses are summarized, with the positions of the additional 
changes indicated below the wild-type (WT) sequence. Amino acid positions and charged residues are indicated by numbers and plus/minus signs, 


respectively. 


morphology, and growth characteristics similar to those of 
wild-type virus and no additional changes in the E, M, or N 
gene (Fig. 3). On the other hand, replacement of both negative 
charges with alanine residues gave rise to DD44o.44; viruses 
that retained the introduced substitutions but also had addi- 
tional changes in which serine and arginine residues at posi- 
tions 424 and 425 in the N protein were replaced by glycines 
(Fig. 3). These changes appeared quickly following electropo- 
ration, since relatively little cytopathic effect was observed dur- 
ing the first 24 h following electroporation, but a significant 
increase in fusion was present by 48 h. Sequence analysis prior 
to plaque purification indicated that the recovered virus con- 
sisted of a mixed population at positions 424 and 425 (data not 
shown). Seven plaques were selected and analyzed following 
five passages. All retained the original alanine substitutions 
and the additional glycine changes at positions 424 and 425. 
The plaque-purified viruses all exhibited plaque size, morphol- 
ogy, and growth kinetics indistinguishable from those of the 


wild-type virus (Fig. 3). Taken together, the data strongly in- 
dicate that negative charges are favored at positions 440 and 
441, but they are not absolutely required. The results suggested 
that, even though neutral charges at these positions are toler- 
ated, compensating changes appeared to be necessary for re- 
covery of viable virus. The seemingly exponential growth of the 
DD449-44, 4A virus following electroporation suggested that 
there is strong selective pressure for the virus to compensate 
for the introduced changes. 

Interestingly, unlike the double arginine substitution for the 
DD 449-44, Tesidues, viruses were recovered with the substitu- 
tion of a single positively charged arginine at either position. 
However, the Dy4gR and D,,4,R viruses exhibited a variety of 
genotypes (Fig. 4). All of the plaque-purified viruses for both 
the D,,)9R and D,,,R viruses retained the introduced arginine, 
with one exception. One wild-type revertant was isolated from 
the D,,,R mutant. Approximately one-quarter of the analyzed 
plaques from the D,,,;R mutant had no additional changes, 
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FIG. 5. Plaque morphologies and growth kinetics of reconstructed N mutant viruses with compensating changes. (A) Mouse 17Cl1 cells were 
infected with wild-type (WT) infectious cloned virus or R4ygs5G DyaoR, T4ogN DagoR, RazsG DagiR, or Ay36D Dy4,R reconstructed virus at an MOI 
of 5. Growth kinetics properties were analyzed by plaque assay in L2 cells. The error bars indicate average deviations of two independent growth 
kinetics experiments. (B) Plaque sizes and morphologies of the compensated viruses relative to the wild-type virus in infected L2 cells are shown. 


whereas all of the recovered D,,)R viruses exhibited additional 
changes in the N gene. The D,,4,R viruses that exhibited no 
additional changes were genetically stable through five pas- 
sages if allowed to grow for a short period, but the viruses were 
very crippled. They grew to titers 3 to 4 orders of magnitude 
lower than those of the wild-type parent and exhibited a small- 
plaque phenotype (Fig. 3). 

The majority of the additional changes for both the Diz 9R 
and D,,4,R mutants were concentrated in the region between 
residues 424 and 436 in the N protein (Fig. 4). All of the 
recovered viruses with changes in this region exhibited wild- 
type phenotypes. Most of the recovered plaque-purified viruses 
for both mutants had an arginine-to-glycine change at position 
425. The frequency of the changes strongly suggested that they 
might be compensating mutations that provided the mutant 
viruses with a significant growth advantage. Analysis showed 
that D,4)R plaques with either the R,,;G or the T,.,.N change 
and D,,4,R plaques with either the R,,;G or the A,,,D change 
were indistinguishable from wild-type virus in terms of plaque 
size, morphology, and growth characteristics. 

Other changes within the N protein compensate for muta- 
tions in the D,,)R and D,,,R recovered viruses. To confirm 
which of the additional changes in the recovered viruses were 
providing a compensatory growth advantage, four independent 
viruses were constructed. We focused on the changes that were 
recovered most frequently, Ryo;G coupled with Dy 9R or 
DyaiR, Ta2gN in combination with Dy R, and Ay3,D with 


D441R (Fig. 5). Each of the selected mutations was introduced, 
and new infectious clones were assembled to ensure that no 
other incidental changes elsewhere in the genome were pro- 
viding a growth advantage to the virus. 

Viruses were recovered from all of the newly constructed 
clones. The viruses were genetically stable, as demonstrated by 
sequence analysis of multiple independent plaques. The intro- 
duced amino acid substitutions were retained, and no other 
changes arose through the multiple passages. All four recon- 
structed mutant viruses displayed growth and plaque size/mor- 
phology indistinguishable from those of the wild-type virus 
(Fig. 5), thus confirming that the new changes most frequently 
present in the recovered viruses with the original mutations 
were indeed responsible for the growth advantage of the vi- 
ruses. 

Effects of the D,,,R mutation on viral protein and RNA 
levels. We were able to recover only a few D4,,R and no Dy4)R 
mutant viruses that did not have additional compensating 
changes. The D4,4,R virus was clearly crippled, since it exhib- 
ited a small-plaque phenotype (Fig. 3), slow growth, and very 
low titer. Working with the virus was difficult because of the 
low titer and the appearance of compensating changes that 
often arose when the virus was grown through multiple repli- 
cation cycles. Nonetheless, we were able to carry out limited 
analysis of this virus, which provided some insight into how the 
mutant protein impacts virus growth. Initially, mouse L2 cells 
were infected at an MOI of 0.0001, and the amount of released 
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FIG. 6. Macromolecular synthesis of crippled D44,R no. 8 compared with wild-type virus and recovered viruses with second-site compensating 
changes. Mouse 17Cl1 cells were infected at an MOI of 0.1. (A) Analysis of protein synthesis and extracellular virus. Proteins were metabolically 
radiolabeled with [?°S]methionine and cysteine for 30 min at 5 h p.i. (lanes 1 to 4) and chased for 4 h (lanes 5 to 12). Intracellular viral proteins 
(lanes 1 to 8) and extracellular virions (lanes 9 to 12) were immunoprecipitated and analyzed by SDS-PAGE and autoradiography (top). Molecular 
weights in thousands are show to the left, and the positions of the viral proteins are indicated on the right. Quantification of S, M, and N proteins 
was obtained by densitometry. The density values for intracellular viral proteins at 5.5 h p.i. were summed and are expressed relative to the same 
sum for the wild-type virus (bottom, Intracellular). Corresponding density values for the extracellular viruses at 9.5 h p.i. were normalized to the 
intracellular viral-protein sums at 5.5 h p.i. and plotted relative to the wild type (WT) (bottom, Extracellular). (B) Total intracellular RNA was 
extracted and analyzed by Northern blotting with a probe specific for the N gene (top). The numbers appended to the mutant names refer to 
specific plaque numbers of recovered viruses as described in the text. Density values for the sum of the four smallest subgenomic RNAs were 
expressed relative to the sum for the wild-type virus (bottom). Density values were obtained from Northern blot analysis similar to the one shown 
but exposed for a shorter time (bottom). The data represent average values from two experiments for both protein and RNA analyses. 


virus was analyzed by plaque assay. No N,4,R mutant virus was 
detected at 11 h p.i., and 100 to 500 times less virus was 
detected during the 16- to 27-h p.i. period compared with the 
compensated D,,4,R virus at these times (data not shown). The 
slow growth was consistent with the small-plaque phenotype and 
the slow expansion of the virus that were observed in the initial 
infections with the N,4,R mutant viruses that were recovered 
without any compensating changes. 

To determine if the small plaque size and reduced growth 
correlated with the amount of viral macromolecular synthesis, 
viral protein and subgenomic RNA syntheses were examined. 
Mouse 17Cl1 cells were infected at an MOI of 0.1. Pulse-chase 


experiments revealed that at 5.5 h p.i., the overall viral protein 
synthesis for the N,,,R no. 8 mutant was approximately 50% of 
that observed with the wild-type virus and the recovered N44,R 
no. 4 that contained the R4,;G compensating change. How- 
ever, after a 4-h chase, the amount of extracellular N44,R no. 
8 virus was less than 10% of what was measured for the wild- 
type and N44,R no. 4 viruses (Fig. 6A). 

The viral RNA levels were also analyzed, since the N protein 
apparently plays some role in viral RNA synthesis (40). Cells 
were infected with wild-type virus and recovered viruses D44,R 
no. 1, Dg4,R no. 4, D44o.44,; AA no. 3 (all with the compensa- 
tory glycines at positions 424 and/or 425), and D44,R no. 8. 
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Intracellular RNA was harvested and analyzed by Northern 
blotting at 12 h p.i. using an N gene-specific probe. All of the 
recovered viruses, including the crippled D,,,R no. 8 virus, 
displayed the characteristic nested set of subgenomic RNAs 
(Fig. 6B). This indicated that the D4,,R no. 8 virus is tran- 
scriptionally functional, but the amount of intracellular sub- 
genomic RNA was reduced by approximately one-third com- 
pared with the wild-type and D,,,R no. 4 viruses. In spite of 
the differences in subgenomic RNA amounts, we conclude, 
based on the level of protein synthesis and the virus output 
levels, that the mutant virus is not assembled as efficiently as 
the wild-type virus and compensated D,4,R no. 4 virus. 

Mutations are predicted to impact the overall charge of 
domain III. A proteomics approach was taken to gain insight 
into how the various charge substitutions might impact the N 
protein. We used ScanSite pI/M,, with the option to include 
phosphorylation sites, on the ExPASy proteomics server at the 
Swiss Institute of Bioinformatics to determine the predicted 
effect of the mutations and the compensating changes that 
were found in the recovered viruses on the theoretical pI of the 
N protein. The predicted pI for the wild-type protein is very 
basic, ranging between 9.76 for the unphosphorylated MHV- 
CoV A59 N protein and 9.38 if the protein is phosphorylated 
at as many as four sites. The individual changes that we intro- 
duced were predicted to result in small increases in the pI of 
the entire protein. If domain III and its concentration of con- 
served negative charges within the domain were functionally 
important for protein-proteins interactions, at least part of the 
domain would likely be exposed on the surface of the protein. 
We reasoned that analysis of this domain alone might provide 
insight into how our mutations could be disruptive and if the 
compensating changes that were observed would alleviate 
changes in the overall charge of the domain. Thus, we focused 
on domain III and its acidic subdomain, where the conserved 
negative charges are located (Table 2). 

There is one predicted phosphorylation site at position T>5 
in domain III. Preliminary data in our laboratory suggested 
that the site is not phosphorylated on the virion-associated N, 
but we had not yet determined if the site is phosphorylated on 
the intracellular N protein (T. C. White and B. G. Hogue 
unpublished data). Therefore, we analyzed the domain both 
with and without phosphorylation of the site. Replacement of 
DD440-441 With positive charges was predicted to have the 
greatest impact on the pI of domain HI, with the most dramatic 
effect on its acidic domain (Table 2). In most cases an increase 
in the predicted pI correlated with a crippled phenotype of our 
mutant viruses, and the pI was predicted to be shifted back 
closer to that of the wild-type N in the mutants that were 
recovered with compensating changes. In the case of the 
DD440-44, RR mutant, the pI for domain III was predicted to 
increase from 4.14 to 4.81 if T228 was not phosphorylated, 
whereas for its acidic subdomain, a change from 3.21 to 4.23 
would result compared with the wild-type protein. In the most 
frequently recovered viruses with compensating changes, the 
overall charge of domain III was predicted to be less positive, 
closer to that of the wild-type protein. For example, the pre- 
dicted pI of domain III of the D,,,R no. 4 virus would decrease 
to 4.24 compared with 4.42 for the D44,R no. 8 virus, which 
lacked the R4,;G compensating change. This appears to be a 
relatively small change, but it is consistent with what was pre- 
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TABLE 2. Predicted isoelectric points* of wild-type and mutant N 
protein domain III’ 


Virus ae ee Recovered? domain III pit 

Wit 4.14, 3.99 3.21 

DygaoR 4.42, 4.24 3,71 4.24 4.07 (R4zsG DygoR) 
4.42 (TyogN DagoR) 

DyaiR 4.42, 4.24 3.71 4.24 4.07 (R4zsG Dy4,R) 
4.30 4.15 (Ay36D Dy4,R) 

DD 440-44, RR 4.81, 4.56 4.23 

DD 449.44, AA 4.36, 4.17 3.37 4.16 3.97 (SRyo4-495GG, 

DD 440-441 AA) 

Dy4o-441.EE 4.22, 4.06 3.29 

DygoA 4.24, 4.07 3.28 

Dy5,;A 4.24, 4.07 3.28 

Dys5,E 4.18, 4.02 3:25: 


EDD 449.45, AAA 4.46, 4.24 3.42 


* The isoelectric points are calculated using the algorithm from ExPASy’s 
proteomics server at the Swiss Institute of Bioinformatics Compute pI/M,, pro- 
gram with the option to include phosphorylation sites. 

» 406F VDNVSVAKPKSSVORNVSRELTPEDRSLLAQILDDGVVPDGLE 
DDSNV***. 

© Amino acids 406 to 454. 

4 Predicted pI values based on zero (left) or one (right) phosphorylation site 
at T228. 

° Amino acids $433 to V454. 

FT, wild type. 

8 Wild-type amino acids and numbers, followed by the second-site amino acid 
changes (Ryo5sG, TyogN, RyosG, Ay36D, SR424.425GG) or corresponding origi- 
nally introduced mutations (D44gR, Dy4,R, DD 449.44, AA) in recovered viruses 
are shown in parentheses. 


dicted for the viruses that were recovered with the introduced 
mutations at D,,4,, and singly within EDD,4,45_45,, that had no 
other changes but grew like the wild type. The predictions are 
consistent with our conclusion that DD449.44; are important in 
helping to maintain the overall negative charge of the domain. 


DISCUSSION 


In this study, we examined the importance of negatively 
charged amino acids located within the carboxy-terminal 22 
amino acids of the MHV-CoV A59 N protein. We found that 
two of the residues, DD 449.441, are functionally important for 
virus output. When both of the aspartic acids were replaced by 
negatively charged glutamic acid residues, viruses with no new 
changes and a wild-type phenotype were recovered. Signifi- 
cantly, viruses were not recovered when both residues were 
replaced with positive charges. Replacement of both amino 
acids with neutrally charged alanine or individual replacement 
of either residue with positively charged arginine was toler- 
ated, but the vast majority of the recovered viruses also had 
compensating changes which restored the wild-type pheno- 
type. A few very crippled viruses were recovered that retained 
the single charge reversal substitution at position 441 without 
additional new changes. Altogether, the results indicate that 
the negative charges at positions 440 and 441 are key residues. 
The data support the idea that the residues are involved in 
virus assembly. 

Maintenance of the overall negative charge within the car- 
boxy end of the N protein appears to be important, since most 
of the compensating changes are predicted to impact the over- 
all charge of the domain. The most prominent compensating 
change isolated was the replacement of R4,; with glycine when 
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either Dy4q or Dy4, was changed to a positively charged resi- 
due. In addition to elimination of the positive charge at posi- 
tion 425, several recovered viruses from the D,,,R mutant had 
new changes in which amino acids were replaced by negative 
charges (N42.D and A43,D). All of the recovered viruses with 
the DD 449.44, AA change had compensating changes, with S454 
and Ry»; each replaced by glycine. Reconstruction of the N 
double mutants with both the original and new residues that 
were present in the recovered viruses confirmed that the ad- 
ditional changes provide a growth advantage. This was specif- 
ically illustrated with the few D,,,R mutants without compen- 
sating changes that were recovered, compared with the D4,R 
recovered viruses that also had the Ry.;G or Ay3,D change 
that clearly suppressed the defect seen with the parental mu- 
tant virus. This apparent gain of function observed with the 
compensatory changes supports our conclusion that the overall 
balance of negative charges in this region is important for the 
function(s) of the protein. Furthermore, analysis of the pre- 
dicted pI of domain III and its acidic domain for the wild type, 
mutants, and recovered compensated mutant viruses supports 
the conclusion. Replacement of DD44o.44, with positive 
charges had the greatest negative impact on the virus, since no 
viruses were recovered for this mutant. Replacement of these 
key residues, either singly or together, is predicted to result in 
an increase in the pI of domain III and, more dramatically, its 
acidic domain. In the most frequently recovered viruses with 
compensating changes, the overall charge of domain III is 
predicted to be less positive, closer to that of the wild-type 
protein. This is consistent with our conclusion that DD 449.44; 
are important in helping to maintain the overall negative 
charge of the domain. 

There are several possibilities that may account for the im- 
portance of the negative charges in domain III of the multi- 
functional N protein. Data from both biochemical and genetic 
analyses have clearly established that the M protein interacts 
with viral nucleocapsids (13, 22, 29). A number of other po- 
tential functions are more speculative or less well defined. N 
protein molecules likely interact with each other in the helical 
nucleocapsid structure. There seems to be some variation 
among coronaviruses, but N molecules are thought to homo- 
oligomerize. MHV-CoV N is thought to form trimers, whereas 
recent data on SARS-CoV suggest that N forms dimers and 
tetramers (18, 28, 38a, 51). Recently, evidence for a role of the 
N protein in viral transcription and/or replication was provided 
(40). It has also been suggested that SARS N may impact cell 
signaling pathways (17). Since charged residues often mediate 
protein-protein interactions, the conserved negative charges 
may participate in important specific N-protein (N-M and 
N-N, other N-virus, or N-host) interactions that are involved in 
any of the roles that N plays during the virus life cycle. The 
DD440.44, Tesidues conceivably could participate in electro- 
static interactions involved in any of these functions. 

While the reviews of our work were being addressed, results 
from a study similar to ours were published (20). A series of 
clustered charge-to-alanine mutations were introduced by tar- 
geted RNA recombination into the carboxy end of the MHV- 
CoV N protein in the study. Aspartic acids 440 and 441 were 
also identified in the study as important key residues in the 
protein. Interestingly, viruses were recovered with compensat- 
ing changes not only in the N gene, as we found, but also within 
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the M gene. Roughly one-third of the recovered viruses had 
compensating changes in M. As with our study, compensating 
changes in the N gene mapped within domain III. The major 
compensating change within N was replacement of glutamine 
437 by leucine. This change would not result in a difference in 
the overall charge of the domain compared with the parental 
DD 440-44, 4A mutant. It was hypothesized that the Q,37L sub- 
stitution might result in replacement of a lost electrostatic 
interaction with a hydrophobic interaction between the car- 
boxy ends of the N and M proteins. Evidence from the study 
strongly suggests that D449 and D4, are involved in mediating 
N-M interactions during virus assembly. Overall, the results of 
that study and ours are complementary, since both indepen- 
dently identified the same key residues using different reverse- 
genetic approaches. Each study also provides unique addi- 
tional new information that reveals new insights into an 
obviously important functional domain in the multifunctional 
N protein. 

We entered our study with the assumption that electrostatic 
charge interactions likely occur between the carboxy ends of 
the N and M proteins during virus assembly. This assumption 
was based in part on the results of an earlier study in which 
deletion of the penultimate positive charge at the COOH end 
of the MHV M protein resulted in recovery of viruses with 
additional changes in both the M and N proteins that could 
compensate for the charge loss (22). We expected that if any of 
our charge substitution mutations impacted the M-nucleocap- 
sid interaction, and if the compensatory changes could restore 
the function, some of the additional changes would map within 
the M protein. It is clear from the recently published study by 
Hurst et al. that changes within M can compensate for DD 449.441 
mutations. It is not clear why all of the compensating changes 
that we identified mapped within the N gene and why a greater 
number of compensatory changes were identified. Nonethe- 
less, our results strongly demonstrate that the loss of the neg- 
ative charges at amino acid positions 440 and 441 can be 
compensated for by several changes that map within 15 to 20 
residues amino terminal to the residues in the N protein alone. 
Our compensating changes in N arose quickly following elec- 
troporation of the full-length infectious cloned RNA, indicat- 
ing that removal of the negative charges places significant 
pressure on the virus for selection of more fit, viable viruses. 
Once the new changes arose within N, the virus was apparently 
no longer under pressure for additional changes to arise else- 
where within the genome that might alternatively compensate 
for the lethal or crippling effects of the charge reversals at 
positions 440 and 441. 

Our DD449.44, charge substitution mutants clearly had an 
impact on virus growth and output. Replacement of both res- 
idues with positive charges was apparently lethal to the virus, 
since we were not able to recover virus following electropora- 
tion. The few D,,,R viruses that were isolated with only the 
introduced single substitution all exhibited a very crippled phe- 
notype, giving rise to very small plaques and titers that were 
significantly below that of the wild-type virus. This reduction is 
less than what would be expected based on the amount of 
intracellular viral protein synthesized by the mutant. This 
strongly indicates that virus assembly is compromised when the 
negatively charged DD 449.44; residues are removed. 

Further studies will be necessary to determine whether the 
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DD 449-441 Tesidues are involved in transcription and/or repli- 
cation in addition to virus assembly. The reduction in D44,R 
virus output compared with the wild type and the compensated 
parent virus could account for the reduced total viral sub- 
genomic RNA of the mutant. However, we cannot exclude at 
this point the possibility that the D,,,R mutation has a direct 
effect on whatever role N plays in viral RNA synthesis. There 
is precedent for a multifunctional role of the nucleocapsid 
proteins of other viruses. The N proteins of negative-stranded 
RNA viruses, in addition to functioning as structural compo- 
nents, are also thought to play roles in transcription, replica- 
tion, and intracellular trafficking of viral genomes. For exam- 
ple, the N protein of vesicular stomatitis virus encapsidates the 
genomic RNA as a helical ribonucleoprotein structure, but it 
has also been demonstrated in many studies that the protein is 
involved in RNA replication (47). A new model for vesicular 
stomatitis virus transcription and replication was recently de- 
scribed in which the RNA polymerase exists as part of two 
distinct complexes in virus-infected cells (38). The transcrip- 
tase complex consists of the polymerase (L), phosphoprotein 
(P), and host proteins, while the replicase complex consists of 
the L, P, and N proteins. The N protein also obviously func- 
tions as a structural component of virions, since it encapsidates 
the genomic RNA. The influenza nucleoprotein is another 
example of a structural protein that encapsidates the virus 
genome. Results from many studies suggest that the NP pro- 
tein is also involved viral RNA synthesis as part of the ribo- 
nucleoprotein particle (37). How easy it will be to uncouple 
virus assembly and RNA synthesis in coronavirus-infected cells 
remains to be determined. 

The concentration of primarily negative charges in domain 
III of coronavirus N proteins is a conserved feature across the 
family. However, the inability of the domains to be exchanged 
between two closely related group II coronaviruses suggests 
that more than just the charged residues are important (36). 
Since we identified compensating changes only within the N 
protein itself, this suggests that, in addition to maintaining the 
net negative charge, the overall structure of the carboxy-ter- 
minal region of the protein may be important in allowing the 
protein, more specifically domain III, to play its functional 
role(s) in N-N, N-M, or other N-protein interactions. The 
frequency with which Ry»; was replaced with glycine when the 
single positive charges were introduced at position 440 or 441, 
as well as the additional replacement of the adjacent amino 
acid 424 in the case where both were mutated, is consistent 
with the idea that the structural presentation of the negative 
charges is likely important. The small side chain on glycine 
promotes conformational freedom and allows greater flexibil- 
ity in the surrounding areas, especially when multiple glycines 
are in close proximity to each other (32). Since structural 
information on the N protein is limited, definitive assessment 
of the importance and requirements for presentation of the 
negative charges in the context of the whole molecule must 
await the availability of this information. A series of protease 
digestions and mass spectrometry identification of peptides 
released over a time course suggest that the acidic domain is 
exposed on the surface (White and Hogue, unpublished). Re- 
cently, the three-dimensional structures of the N termini of 
both SARS-CoV N (residues 45 to 181) and infectious bron- 
chitis virus N (residues 29 to 160) were determined by nuclear 
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magnetic resonance and crystallization, respectively (14, 19). 
In the case of IBV N, it was proposed that the carboxy end, but 
not including the extreme end of the domain, where negatively 
charged residues are concentrated, is important for oligomer- 
ization of the protein (14). Clearly, further structural studies 
are important to better define the conformational and dynamic 
changes within the carboxy-terminal negatively charged do- 
main of this important multifunctional protein that are respon- 
sible for helping to drive virion assembly and the possible 
involvement in other N-protein interactions involved in the 
other functional roles of the protein. 
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